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RAW SEQUENCE LISTING DATE: 02/14/2002 

PATENT APPLICATION: US/10/010,408 TIME: 19:09:10 

Input Set : N:\Crf3\RULE60\10010408.txt 
Output Set: N:\CRF3\02142002\J010408.raw 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: John J. Castellot, Jr. 
(ii) TITLE OF INVENTION: Novel Heparin -Induced CCN-Like Molecules 

and Uses Therefor 
(iii) NUMBER OF SEQUENCES: 13 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD, LLP 

(B) STREET: 28 State Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.2 5 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/10/010,408 

(B) FILING DATE: 07-Dec-2001 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/044,273 ' - 

(B) FILING DATE: March 19, 1998 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Amy E. Mandragouras 

(B) REGISTRATION NUMBER: 36,207 

(C) REFERENCE/DOCKET NUMBER: MBI-004 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)742-4214 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 
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RAW SEQUENCE LISTING DATE: 02/14/2002 

PATENT APPLICATION: US/10/010,408 TIME: 19:09:10 

Input Set : N:\Crf3\RULE60\10010408.txt 
Output Set: N:\CRF3\02142002\J010408.raw 

65 (B) LOCATION: 249.. 1001 

68 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

70 GACGCTTCTG ATCTCCAGAG GACCCTGGGG TGGGACAGGG GCCTTGGCAA GGCTGCAGCC 60 
72 GCTGGGCAGT GGCTTGGAAT GGAGGTCTTT ATTACTGGGA ACTGAGGAGC TAAGAGGCTC 120 
74 CTGTCAGCTT GTCCTAAAGT CTTAGCACTT GTGGTGGCTT GGGCTTCACA CACTGTCAGA 180 
76 CACCTTCGTG GTGGCCTCCA CGGCCTCACC TTCAGGTTTG AAGCTGGCTC CACAAGGGAC 240 

78 ACGGTGAC ATG AGG GGC AGC CCA CTG ATC CAT CTT CTG GCC ACT TCC TTC 290 

79 Met Arg Gly Ser Pro Leu lie His Leu Leu Ala Thr Ser Phe 

80 1 5 10 

82 CTC TGC CTT CTC TCA ATG GTG TGT GCC CAG CTG TGC CGG ACA CCC TGT 338 

83 Leu Cys Leu Leu Ser Met Val Cys Ala Gin Leu Cys Arg Thr Pro Cys 

84 15 20 25 30 

86 ACC TGT CCT TGG ACA CCA CCC CAG TGC CCA GAG GGG GTA CCC CTG GTG 386 

87 Thr Cys Pro Trp Thr Pro Pro Gin Cys Pro Gin Gly Val Pro Leu Val 

88 35 40 45 

90 CTG GAT GGC TGT GGC TGC TGT AAA GTG TGT GCA CGG AGG CTG GGG GAG 4 34 

91 Leu Asp Gly Cys Gly Cys Cys Lys Val Cys Ala Arg Arg Leu Gly Glu 

92 50 55 60 

94 TCC TGC GAC CAC CTG CAT GTC TGC GAC CCC AGC CAG GGC CTG GTT TGT 4 82 

95 Ser Cys Asp His Leu His Val Cys Asp Pro Ser Gin Gly Leu Val Cys 

96 65 70 75 

98 CAG CCT GGG GCA GGC CCT GGC GGC CAT GGG GCT GTG TGT CTC TTG GAT 530 

99 Gin Pro Gly Ala Gly Pro Gly Gly His Gly Ala Val Cys Leu Leu Asp 

100 80 85 90 

102 GAG GAT GAC GGT AGC TGT GAG GTG AAT GGC CGC AGG TAC CTG GAT GGA 578 

103 Glu Asp Asp Gly Ser Cys Glu Val Asn Gly Arg Arg Tyr Leu Asp Gly 

104 95 100 105 110 

106 GAG ACC TTT AAA CCC AAT TGC AGG GTC CTG TGC CGC TGT GAT GAC GGT 626 

107 Glu Thr Phe Lys Pro Asn Cys Arg Val Leu Cys Arg Cys Asp Asp Gly 

108 115 120 125 

110 GGC TTC ACC TGC CTG CCG CTG TGC AGT GAG GAT GTG CGG CTG CCC AGC 674 

111 Gly Phe Thr Cys Leu Pro Leu Cys Ser Glu Asp Val Arg Leu Pro Ser 

112 130 135 140 

114 TGG GAC TGC CCA CGC CCC AAG AG A ATA CAG GTG CCA GGA AAG TGC TGC 722 

115 Trp Asp Cys Pro Arg Pro Lys Arg lie Gin Val Pro Gly Lys Cys Cys 

116 145 150 155 

118 CCC GAG TGG GTA TGT GAC CAG GGA GTG ACA CCG GCG ATC CAG CGC TCC 770 

119 Pro Glu Trp Val Cys Asp Gin Gly Val Thr Pro Ala lie Gin Arg Ser 

120 160 165 170 

122 ACG GCG CAA GGA CAC CAA CTT TCT GCC CTT GTC ACT CCT GCC TCT GCT 818 

123 Thr Ala Gin Gly His Gin Leu Ser Ala Leu Val Thr Pro Ala Ser Ala 

124 175 180 185 190 

126 GAT GCT CCT TGT CCA AAT TGG AGC ACA GCC TGG GGC CCC TGC TCA ACC 866 

127 Asp Ala Pro Cys Pro Asn Trp Ser Thr Ala Trp Gly Pro Cys Ser Thr 

128 195 200 205 

130 ACC TGT GGG CTG GGC ATA GCC ACC CGA GTG TCC AAC CAG AAC CGA TTC 914 

131 Thr Cys Gly Leu Gly lie Ala Thr Arg Val Ser Asn Gin Asn Arg Phe 

132 210 215 220 

134 TGC CAA CTG GAG ATC CAA CGC CGC CTG TGT CTG CCC AGA CCC TGC CTG 962 
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RAW SEQUENCE LISTING DATE: 02/14/2002 

PATENT APPLICATION: US/10/010 ^ 408 TIME: 19:09:10 

Input Set : N:\Crf3\RULE60\10010408.txt 
Output Set: N:\CRF3\02142002\J010408.raw 

135 Cys Gin Leu Glu lie Gin Arg Arg Leu Cys Leu Pro Arg Pro Cys Leu 

136 225 230 235 

138 GCA GCC AGG AGC CAC AGC TCA TGG AAC AGT GCT TTC TAAGGCCAAC 1008 

139 Ala Ala Arg Ser His Ser Ser Trp Asn Ser Ala Phe 

140 240 ^ 245 250 

142 TGGGGATGCG GATACAGGGC CTGCCATCCT CAGCAAATGA CCCTAGGACC AGGCCCTGGA 1068 
144 CTGCTGGTAG ATGCTCTTCT CCATGCTCTT GGCTGCAGTT AACTGTCCTG CTTGGATTCA 1128 
14 6 CTGTGTAGAG CCACTGAGCG ATCCCTGCTC TGTCTGAGGT AGGCGGAGCA GGTGACCAGC 1188 
14 8 TCCAGTTCTC TGGTTCAGCC TGGAATTCTG GGTTCTCCTG GCTCATTCCT CAAAACATCC 124 8 

150 CTGTACAAAA AGGACAACCA AAAAGACCTT TAAACCTAGG CTATACTGGG CAAACCTGGC 1308 
152 CACCGTGCTG GGGATAAGGT CAATGTTAGG ACCAGACAGC AGATTGCCTG AAACTTCCAA 1368 
154 TTCCCTTCTT GGACTTCTGT ATGCTTGTCC CCAAAGATGA TGAATGAACT CGTAAGTGTA 1428 
156 CCTTCCCTGA CCTGAGAACA CCCTGCCTGC TCGGGAAGTA TTCAGGGGCA GAATTCTCTG 1488 
158 TGAACATGAA GAGATGAATC ACACTGTCCT TAAGAAATTC CTGAAAGTCC AGGAACTTGA 1548 
160 GCTTTGTATT TTCAGGAATG CACATCTCTT AAGCACTCGC AAAACAGGAA GGCTCCACAC . 1608 
162 CTCTGGCAGG CCAGGGCCTT TCTCTTCAGC ATGAGAAAGA CAAGGGACAG CAGAGTACTC 16 68 
164 TCCTCTGGAG GACTAGTCTA GCCTAGAATA AACACCCAAA 1708 
167 (2) INFORMATION FOR SEQ ID NO: 2: 

169 (i) SEQUENCE CHARACTERISTICS: 

170 (A) LENGTH: 250 amino acids 

171 (B) TYPE: amino acid 

172 (D) TOPOLOGY: linear 
174 (ii) MOLECULE TYPE: protein 

176 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

178 Met Arg Gly Ser Pro Leu lie His Leu Leu Ala Thr Ser Phe Leu Cys 

179 15 10 15 

181 Leu Leu Ser Met Val Cys Ala Gin Leu Cys Arg Thr Pro Cys Thr Cys 

182 20 25 30 

184 Pro Trp Thr Pro Pro Gin Cys Pro Gin Gly Val Pro Leu Val Leu Asp 

185 35 40 45 

187 Gly Cys Gly Cys Cys Lys Val Cys Ala Arg Arg Leu Gly Glu Ser Cys 

188 50 55 60 " 

190 Asp His Leu His Val Cys Asp Pro Ser Gin Gly Leu Val Cys Gin Pro 

191 65 70 75 80 

193 Gly Ala Gly Pro Gly Gly His Gly Ala Val Cys Leu Leu Asp Glu Asp 

194 85 90 95 

196 Asp Gly Ser Cys Glu Val Asn Gly Arg Arg Tyr Leu Asp Gly Glu Thr 

197 100 105 110 

199 Phe Lys Pro Asn Cys Arg Val Leu Cys Arg Cys Asp Asp Gly Gly Phe 

200 115 120 125 

202 Thr Cys Leu Pro Leu Cys Ser Glu Asp Val Arg Leu Pro Ser Trp Asp • 

203 130 , 135 140 

205 Cys Pro Arg Pro Lys Arg lie Gin Val Pro Gly Lys Cys Cys Pro Glu 

206 145 150 155 160 

208 Trp Val Cys Asp Gin Gly Val Thr Pro Ala lie Gin Arg Ser Thr Ala 

209 165 170 175 

211 Gin Gly His Gin Leu Ser Ala Leu Val Thr Pro Ala Ser Ala Asp Ala 

212 180 185 190 • 

214 Pro Cys Pro Asn Trp Ser Thr Ala Trp Gly Pro Cys Ser Thr Thr Cys 
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RAW SEQUENCE LISTING DATE: 02/14/2002 

PATENT APPLICATION: US/10/010,408 TIME: 19:09:10 

Input Set : N:\Crf3\RULE60\10010408.txt 
Output Set: N:\CRF3\02142002\J010408.raw 
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SEQUENCE CHARACTERISTICS: 
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SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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RAW SEQUENCE LISTING DATE: 02/14/2002 

PATENT APPLICATION: • US/IO/010 , 408 TIME: 19:09:10 

Input Set : N:\Crf3\RULE60\10010408.txt 
Output Set: Nj\CRF3\02142002\J010408.raw 

284 Trp Val Cys Asp Gin Gly Val Thr Pro Ala lie Gin Arg Ser Thr Ala 

285 165 170 175 

287 CAA GGA CAC CAA CTT TCT GCC CTT GTC ACT CCT GCC TCT GCT GAT GCT 576 

288 Gin Gly His Gin Leu Ser Ala Leu Val Thr Pro Ala Ser Ala Asp Ala 

289 180 185 190 

291 CCT TGT CCA AAT TGG AGC ACA GCC TGG GGC CCC TGC TCA ACC ACC TGT 624 

2 92 Pro Cys Pro Asn Trp Ser Thr Ala Trp Gly Pro Cys Ser Thr Thr Cys 
293 195 200 ' 205 

295 GGG CTG GGC ATA GCC ACC CGA GTG TCC AAC CAG AAC CGA TTC TGC CAA 672 

296 Gly Leu Gly lie Ala Thr Arg Val Ser Asn Gin Asn Arg Phe Cys Gin 

297 210 215 220 

299 CTG GAG ATC CAA CGC CGC CTG TGT CTG CCC AG A CCC TGC CTG GCA GCC 720 

300 Leu Glu lie Gin Arg Arg Leu Cys Leu Pro Arg Pro Cys Leu Ala Ala 

301 225 230 235 240 

303 AGG AGC CAC AGC TCA TGG AAC AGT GCT TTC TAA 753 

304 Arg Ser His Ser Ser Trp Asn Ser Ala Phe 

305 245 250 
308 (2) INFORMATION FOR SEQ ID NO : 4 : 

310 (i) SEQUENCE CHARACTERISTICS: 

311 (A) LENGTH: 8 amino acids 

312 (B) TYPE: amino acid 

313 (D) TOPOLOGY: linear 
315 (ii) MOLECULE TYPE: peptide 
317 (V) FRAGMENT TYPE: internal 

321 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

W--> 323 Gly Cys Gly Cys Cys Xaa Xaa Cys 

324 1 5 

32 6 (2) INFORMATION FOR SEQ ID NO : 5: 

328 (i) SEQUENCE CHARACTERISTICS: 

329 (A) LENGTH: 177 base pairs 

330 (B) TYPE: nucleic acid 

331 (C) STRANDEDNESS : single 

332 (D) TOPOLOGY: linear 
334 (ii) MOLECULE TYPE: cDNA 

337 (ix) FEATURE: 

338 (A) NAME/KEY: CDS 

339 (B) LOCATION: 1..177 

342 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

344 TGT GAG GTG AAT GGC CGC AGG TAC CTG GAT GGA GAG ACC TTT AAA CCC 48 

34 5 Cys Glu Val Asn Gly Arg Arg Tyr Leu Asp Gly Glu Thr Phe Lys Pro 
346 15 10 15 

34 8 AAT TGC AGG GTC CTG TGC CGC TGT GAT GAC GGT GGC TTC ACC TGC CTG 96 

349 Asn Cys Arg Val Leu Cys Arg Cys Asp Asp Gly Gly Phe Thr Cys Leu 

350 20 25 30 

352 CCG CTG TGC AGT GAG GAT GTG CGG CTG CCC AGC TGG GAC TGC CCA CGC 144 

353 Pro Leu Cys Ser Glu Asp Val Arg Leu Pro Ser Trp Asp Cys Pro Arg 

354 35 40 45 

356 CCC AAG AGA ATA CAG GTG CCA GGA AAG TGC TGC 177 

357 Pro Lys Arg lie Gin Val Pro Gly Lys Cys Cys 
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VERIFICATION SUMMARY DATE: 02/14/2002 

PATENT APPLICATION: US/10/010,408 TIME: 19:09:11 

Input Set : N:\Crf3\RULE60\10010408.txt 
Output Set: N:\CRF3\02142002\J010408.raw 

L:28 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
L:29 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:323 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 
L:399 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 7 
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